Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 3334 |
| Missing cells | 977 |
| Missing cells (%) | 0.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 742.5 KiB |
| Average record size in memory | 228.0 B |
Variable types
| Numeric | 21 |
|---|---|
| Categorical | 4 |
| Text | 3 |
| Boolean | 4 |
Unnamed: 0 is highly overall correlated with osebuildingid | High correlation |
osebuildingid is highly overall correlated with Unnamed: 0 | High correlation |
councildistrictcode is highly overall correlated with latitude and 1 other fields | High correlation |
propertygfatotal is highly overall correlated with propertygfabuilding_s and 4 other fields | High correlation |
propertygfabuilding_s is highly overall correlated with propertygfatotal and 4 other fields | High correlation |
largestpropertyusetypegfa is highly overall correlated with propertygfatotal and 4 other fields | High correlation |
energystarscore is highly overall correlated with sourceeui_kbtu_sf and 2 other fields | High correlation |
siteeui_kbtu_sf is highly overall correlated with siteeuiwn_kbtu_sf and 6 other fields | High correlation |
siteeuiwn_kbtu_sf is highly overall correlated with siteeui_kbtu_sf and 5 other fields | High correlation |
sourceeui_kbtu_sf is highly overall correlated with energystarscore and 5 other fields | High correlation |
sourceeuiwn_kbtu_sf is highly overall correlated with energystarscore and 5 other fields | High correlation |
siteenergyuse_kbtu is highly overall correlated with propertygfatotal and 8 other fields | High correlation |
siteenergyusewn_kbtu is highly overall correlated with propertygfatotal and 8 other fields | High correlation |
totalghgemissions is highly overall correlated with propertygfatotal and 6 other fields | High correlation |
latitude is highly overall correlated with councildistrictcode and 1 other fields | High correlation |
buildingtype is highly overall correlated with primarypropertytype and 1 other fields | High correlation |
primarypropertytype is highly overall correlated with buildingtype and 1 other fields | High correlation |
neighborhood is highly overall correlated with councildistrictcode and 1 other fields | High correlation |
defaultdata is highly overall correlated with buildingtype and 2 other fields | High correlation |
compliancestatus is highly overall correlated with energystarscore and 2 other fields | High correlation |
steamuse is highly imbalanced (76.4%) | Imbalance |
electricity is highly imbalanced (96.1%) | Imbalance |
defaultdata is highly imbalanced (78.9%) | Imbalance |
compliancestatus is highly imbalanced (99.2%) | Imbalance |
energystarscore has 825 (24.7%) missing values | Missing |
compliancestatus has 126 (3.8%) missing values | Missing |
numberofbuildings is highly skewed (γ1 = 43.70966104) | Skewed |
propertygfatotal is highly skewed (γ1 = 24.00490414) | Skewed |
propertygfabuilding_s is highly skewed (γ1 = 27.47908747) | Skewed |
largestpropertyusetypegfa is highly skewed (γ1 = 29.97068045) | Skewed |
siteenergyuse_kbtu is highly skewed (γ1 = 24.76248203) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
osebuildingid has unique values | Unique |
propertygfaparking has 2835 (85.0%) zeros | Zeros |
sourceeuiwn_kbtu_sf has 36 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2023-07-04 02:07:43.764592 |
|---|---|
| Analysis finished | 2023-07-04 02:09:52.845386 |
| Duration | 2 minutes and 9.08 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 3334 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1684.1941 |
| Minimum | 0 |
|---|---|
| Maximum | 3375 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 166.65 |
| Q1 | 842.25 |
| median | 1681.5 |
| Q3 | 2529.75 |
| 95-th percentile | 3204.35 |
| Maximum | 3375 |
| Range | 3375 |
| Interquartile range (IQR) | 1687.5 |
Descriptive statistics
| Standard deviation | 974.70541 |
|---|---|
| Coefficient of variation (CV) | 0.578737 |
| Kurtosis | -1.1991233 |
| Mean | 1684.1941 |
| Median Absolute Deviation (MAD) | 844.5 |
| Skewness | 0.0032131493 |
| Sum | 5615103 |
| Variance | 950050.64 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2104 | 1 | < 0.1% |
| 2239 | 1 | < 0.1% |
| 2240 | 1 | < 0.1% |
| 2241 | 1 | < 0.1% |
| 2242 | 1 | < 0.1% |
| 2243 | 1 | < 0.1% |
| 2244 | 1 | < 0.1% |
| 2245 | 1 | < 0.1% |
| 2246 | 1 | < 0.1% |
| Other values (3324) | 3324 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 3375 | 1 | |
| 3374 | 1 | |
| 3373 | 1 | |
| 3372 | 1 | |
| 3371 | 1 | |
| 3370 | 1 | |
| 3369 | 1 | |
| 3368 | 1 | |
| 3367 | 1 | |
| 3366 | 1 |
osebuildingid
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 3334 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21170.645 |
| Minimum | 1 |
|---|---|
| Maximum | 50226 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 271.25 |
| Q1 | 19988.5 |
| median | 23105.5 |
| Q3 | 25988.5 |
| 95-th percentile | 49781.7 |
| Maximum | 50226 |
| Range | 50225 |
| Interquartile range (IQR) | 6000 |
Descriptive statistics
| Standard deviation | 12221.494 |
|---|---|
| Coefficient of variation (CV) | 0.5772849 |
| Kurtosis | 0.64663913 |
| Mean | 21170.645 |
| Median Absolute Deviation (MAD) | 3013 |
| Skewness | -0.0098033577 |
| Sum | 70582931 |
| Variance | 1.4936491 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 24473 | 1 | < 0.1% |
| 24906 | 1 | < 0.1% |
| 24908 | 1 | < 0.1% |
| 24909 | 1 | < 0.1% |
| 24911 | 1 | < 0.1% |
| 24921 | 1 | < 0.1% |
| 24934 | 1 | < 0.1% |
| 24943 | 1 | < 0.1% |
| 24948 | 1 | < 0.1% |
| Other values (3324) | 3324 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 |
| Value | Count | Frequency (%) |
| 50226 | 1 | |
| 50225 | 1 | |
| 50224 | 1 | |
| 50223 | 1 | |
| 50222 | 1 | |
| 50221 | 1 | |
| 50220 | 1 | |
| 50219 | 1 | |
| 50212 | 1 | |
| 50210 | 1 |
buildingtype
Categorical
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| NonResidential | |
|---|---|
| Multifamily LR (1-4) | |
| Multifamily MR (5-9) | |
| Multifamily HR (10+) | 109 |
| SPS-District K-12 | 97 |
| Other values (3) | 109 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.165567 |
| Min length | 6 |
Characters and Unicode
| Total characters | 57230 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NonResidential |
|---|---|
| 2nd row | NonResidential |
| 3rd row | NonResidential |
| 4th row | NonResidential |
| 5th row | NonResidential |
Common Values
| Value | Count | Frequency (%) |
| NonResidential | 1442 | |
| Multifamily LR (1-4) | 999 | |
| Multifamily MR (5-9) | 578 | |
| Multifamily HR (10+) | 109 | 3.3% |
| SPS-District K-12 | 97 | 2.9% |
| Nonresidential COS | 84 | 2.5% |
| Campus | 24 | 0.7% |
| Nonresidential WA | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| multifamily | 1686 | |
| nonresidential | 1527 | |
| lr | 999 | |
| 1-4 | 999 | |
| mr | 578 | 8.4% |
| 5-9 | 578 | 8.4% |
| hr | 109 | 1.6% |
| 10 | 109 | 1.6% |
| sps-district | 97 | 1.4% |
| k-12 | 97 | 1.4% |
| Other values (3) | 109 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6620 | 11.6% |
| l | 4899 | 8.6% |
| 3554 | 6.2% | |
| t | 3407 | 6.0% |
| a | 3237 | 5.7% |
| R | 3128 | 5.5% |
| n | 3054 | 5.3% |
| e | 3054 | 5.3% |
| M | 2264 | 4.0% |
| - | 1771 | 3.1% |
| Other values (30) | 22242 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36068 | |
| Uppercase Letter | 8790 | 15.4% |
| Decimal Number | 3566 | 6.2% |
| Space Separator | 3554 | 6.2% |
| Dash Punctuation | 1771 | 3.1% |
| Open Punctuation | 1686 | 2.9% |
| Close Punctuation | 1686 | 2.9% |
| Math Symbol | 109 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6620 | |
| l | 4899 | |
| t | 3407 | |
| a | 3237 | |
| n | 3054 | |
| e | 3054 | |
| u | 1710 | 4.7% |
| m | 1710 | 4.7% |
| f | 1686 | 4.7% |
| y | 1686 | 4.7% |
| Other values (6) | 5005 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3128 | |
| M | 2264 | |
| N | 1527 | |
| L | 999 | 11.4% |
| S | 278 | 3.2% |
| H | 109 | 1.2% |
| C | 108 | 1.2% |
| K | 97 | 1.1% |
| P | 97 | 1.1% |
| D | 97 | 1.1% |
| Other values (3) | 86 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1205 | |
| 4 | 999 | |
| 5 | 578 | |
| 9 | 578 | |
| 0 | 109 | 3.1% |
| 2 | 97 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 3554 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1771 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1686 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1686 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 109 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44858 | |
| Common | 12372 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6620 | |
| l | 4899 | |
| t | 3407 | 7.6% |
| a | 3237 | 7.2% |
| R | 3128 | 7.0% |
| n | 3054 | 6.8% |
| e | 3054 | 6.8% |
| M | 2264 | 5.0% |
| u | 1710 | 3.8% |
| m | 1710 | 3.8% |
| Other values (19) | 11775 |
Common
| Value | Count | Frequency (%) |
| 3554 | ||
| - | 1771 | |
| ( | 1686 | |
| ) | 1686 | |
| 1 | 1205 | 9.7% |
| 4 | 999 | 8.1% |
| 5 | 578 | 4.7% |
| 9 | 578 | 4.7% |
| 0 | 109 | 0.9% |
| + | 109 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57230 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 6620 | 11.6% |
| l | 4899 | 8.6% |
| 3554 | 6.2% | |
| t | 3407 | 6.0% |
| a | 3237 | 5.7% |
| R | 3128 | 5.5% |
| n | 3054 | 5.3% |
| e | 3054 | 5.3% |
| M | 2264 | 4.0% |
| - | 1771 | 3.1% |
| Other values (30) | 22242 |
primarypropertytype
Categorical
HIGH CORRELATION 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| Low-Rise Multifamily | |
|---|---|
| Mid-Rise Multifamily | |
| Small- and Mid-Sized Office | |
| Other | |
| Warehouse | |
| Other values (19) |
Length
| Max length | 27 |
|---|---|
| Median length | 22 |
| Mean length | 17.181464 |
| Min length | 5 |
Characters and Unicode
| Total characters | 57283 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel |
Common Values
| Value | Count | Frequency (%) |
| Low-Rise Multifamily | 968 | |
| Mid-Rise Multifamily | 561 | |
| Small- and Mid-Sized Office | 288 | 8.6% |
| Other | 253 | 7.6% |
| Warehouse | 187 | 5.6% |
| Large Office | 170 | 5.1% |
| K-12 School | 137 | 4.1% |
| Mixed Use Property | 132 | 4.0% |
| High-Rise Multifamily | 104 | 3.1% |
| Retail Store | 89 | 2.7% |
| Other values (14) | 445 |
Length
| Value | Count | Frequency (%) |
| multifamily | 1633 | |
| low-rise | 968 | |
| mid-rise | 561 | 8.1% |
| office | 500 | 7.2% |
| small | 288 | 4.2% |
| and | 288 | 4.2% |
| mid-sized | 288 | 4.2% |
| other | 253 | 3.7% |
| warehouse | 199 | 2.9% |
| large | 170 | 2.5% |
| Other values (28) | 1777 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7507 | 13.1% |
| e | 4485 | 7.8% |
| l | 4364 | 7.6% |
| 3591 | 6.3% | |
| a | 3005 | 5.2% |
| t | 2762 | 4.8% |
| f | 2673 | 4.7% |
| M | 2653 | 4.6% |
| - | 2374 | 4.1% |
| s | 2156 | 3.8% |
| Other values (33) | 21713 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42458 | |
| Uppercase Letter | 8546 | 14.9% |
| Space Separator | 3591 | 6.3% |
| Dash Punctuation | 2374 | 4.1% |
| Decimal Number | 274 | 0.5% |
| Other Punctuation | 40 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 7507 | |
| e | 4485 | |
| l | 4364 | |
| a | 3005 | 7.1% |
| t | 2762 | 6.5% |
| f | 2673 | 6.3% |
| s | 2156 | 5.1% |
| o | 2088 | 4.9% |
| m | 2051 | 4.8% |
| u | 1982 | 4.7% |
| Other values (14) | 9385 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2653 | |
| R | 1769 | |
| L | 1148 | |
| S | 983 | 11.5% |
| O | 753 | 8.8% |
| W | 268 | 3.1% |
| H | 213 | 2.5% |
| U | 157 | 1.8% |
| C | 143 | 1.7% |
| K | 137 | 1.6% |
| Other values (4) | 322 | 3.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 137 | |
| 2 | 137 |
Space Separator
| Value | Count | Frequency (%) |
| 3591 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2374 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 40 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51004 | |
| Common | 6279 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 7507 | |
| e | 4485 | 8.8% |
| l | 4364 | 8.6% |
| a | 3005 | 5.9% |
| t | 2762 | 5.4% |
| f | 2673 | 5.2% |
| M | 2653 | 5.2% |
| s | 2156 | 4.2% |
| o | 2088 | 4.1% |
| m | 2051 | 4.0% |
| Other values (28) | 17260 |
Common
| Value | Count | Frequency (%) |
| 3591 | ||
| - | 2374 | |
| 1 | 137 | 2.2% |
| 2 | 137 | 2.2% |
| / | 40 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57283 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 7507 | 13.1% |
| e | 4485 | 7.8% |
| l | 4364 | 7.6% |
| 3591 | 6.3% | |
| a | 3005 | 5.2% |
| t | 2762 | 4.8% |
| f | 2673 | 4.7% |
| M | 2653 | 4.6% |
| - | 2374 | 4.1% |
| s | 2156 | 3.8% |
| Other values (33) | 21713 |
| Distinct | 3228 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 10 |
| Mean length | 10.005099 |
| Min length | 9 |
Characters and Unicode
| Total characters | 33357 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3152 ? |
|---|---|
| Unique (%) | 94.5% |
Sample
| 1st row | 0659000030 |
|---|---|
| 2nd row | 0659000220 |
| 3rd row | 0659000475 |
| 4th row | 0659000640 |
| 5th row | 0659000970 |
| Value | Count | Frequency (%) |
| 1625049001 | 8 | 0.2% |
| 3224049012 | 5 | 0.1% |
| 0925049346 | 5 | 0.1% |
| 0002400002 | 5 | 0.1% |
| 7666203240 | 4 | 0.1% |
| 3624039009 | 4 | 0.1% |
| 8809700040 | 3 | 0.1% |
| 1985200003 | 3 | 0.1% |
| 5036300605 | 3 | 0.1% |
| 8632880000 | 3 | 0.1% |
| Other values (3219) | 3293 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11207 | |
| 2 | 3127 | 9.4% |
| 5 | 2906 | 8.7% |
| 6 | 2683 | 8.0% |
| 1 | 2662 | 8.0% |
| 9 | 2359 | 7.1% |
| 7 | 2336 | 7.0% |
| 4 | 2143 | 6.4% |
| 3 | 2041 | 6.1% |
| 8 | 1886 | 5.7% |
| Other values (5) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33350 | |
| Lowercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11207 | |
| 2 | 3127 | 9.4% |
| 5 | 2906 | 8.7% |
| 6 | 2683 | 8.0% |
| 1 | 2662 | 8.0% |
| 9 | 2359 | 7.1% |
| 7 | 2336 | 7.0% |
| 4 | 2143 | 6.4% |
| 3 | 2041 | 6.1% |
| 8 | 1886 | 5.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| n | 1 | |
| d | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 33354 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11207 | |
| 2 | 3127 | 9.4% |
| 5 | 2906 | 8.7% |
| 6 | 2683 | 8.0% |
| 1 | 2662 | 8.0% |
| 9 | 2359 | 7.1% |
| 7 | 2336 | 7.0% |
| 4 | 2143 | 6.4% |
| 3 | 2041 | 6.1% |
| 8 | 1886 | 5.7% |
| Other values (2) | 4 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 1 | |
| n | 1 | |
| d | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33357 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11207 | |
| 2 | 3127 | 9.4% |
| 5 | 2906 | 8.7% |
| 6 | 2683 | 8.0% |
| 1 | 2662 | 8.0% |
| 9 | 2359 | 7.1% |
| 7 | 2336 | 7.0% |
| 4 | 2143 | 6.4% |
| 3 | 2041 | 6.1% |
| 8 | 1886 | 5.7% |
| Other values (5) | 7 | < 0.1% |
councildistrictcode
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4463107 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.1177383 |
|---|---|
| Coefficient of variation (CV) | 0.47629111 |
| Kurtosis | -1.4439402 |
| Mean | 4.4463107 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.073983963 |
| Sum | 14824 |
| Variance | 4.4848154 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 1025 | |
| 3 | 588 | |
| 2 | 503 | |
| 4 | 361 | 10.8% |
| 5 | 337 | 10.1% |
| 1 | 274 | 8.2% |
| 6 | 246 | 7.4% |
| Value | Count | Frequency (%) |
| 1 | 274 | 8.2% |
| 2 | 503 | |
| 3 | 588 | |
| 4 | 361 | 10.8% |
| 5 | 337 | 10.1% |
| 6 | 246 | 7.4% |
| 7 | 1025 |
| Value | Count | Frequency (%) |
| 7 | 1025 | |
| 6 | 246 | 7.4% |
| 5 | 337 | 10.1% |
| 4 | 361 | 10.8% |
| 3 | 588 | |
| 2 | 503 | |
| 1 | 274 | 8.2% |
neighborhood
Categorical
HIGH CORRELATION 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| DOWNTOWN | |
|---|---|
| EAST | |
| MAGNOLIA / QUEEN ANNE | |
| GREATER DUWAMISH | |
| NORTHEAST | |
| Other values (8) |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 10.110678 |
| Min length | 4 |
Characters and Unicode
| Total characters | 33709 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DOWNTOWN |
|---|---|
| 2nd row | DOWNTOWN |
| 3rd row | DOWNTOWN |
| 4th row | DOWNTOWN |
| 5th row | DOWNTOWN |
Common Values
| Value | Count | Frequency (%) |
| DOWNTOWN | 564 | |
| EAST | 448 | |
| MAGNOLIA / QUEEN ANNE | 418 | |
| GREATER DUWAMISH | 371 | |
| NORTHEAST | 274 | |
| LAKE UNION | 251 | |
| NORTHWEST | 220 | 6.6% |
| NORTH | 186 | 5.6% |
| SOUTHWEST | 158 | 4.7% |
| BALLARD | 133 | 4.0% |
| Other values (3) | 311 |
Length
| Value | Count | Frequency (%) |
| downtown | 564 | |
| east | 448 | 8.6% |
| magnolia | 418 | 8.0% |
| 418 | 8.0% | |
| queen | 418 | 8.0% |
| anne | 418 | 8.0% |
| greater | 371 | 7.1% |
| duwamish | 371 | 7.1% |
| northeast | 274 | 5.3% |
| union | 251 | 4.8% |
| Other values (8) | 1259 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 4113 | |
| E | 3743 | |
| A | 3461 | |
| T | 3194 | |
| O | 2730 | 8.1% |
| W | 1877 | 5.6% |
| 1876 | 5.6% | |
| S | 1819 | 5.4% |
| R | 1771 | 5.3% |
| H | 1304 | 3.9% |
| Other values (11) | 7821 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 31415 | |
| Space Separator | 1876 | 5.6% |
| Other Punctuation | 418 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4113 | |
| E | 3743 | |
| A | 3461 | |
| T | 3194 | |
| O | 2730 | |
| W | 1877 | 6.0% |
| S | 1819 | 5.8% |
| R | 1771 | 5.6% |
| H | 1304 | 4.2% |
| U | 1293 | 4.1% |
| Other values (9) | 6110 |
Space Separator
| Value | Count | Frequency (%) |
| 1876 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31415 | |
| Common | 2294 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 4113 | |
| E | 3743 | |
| A | 3461 | |
| T | 3194 | |
| O | 2730 | |
| W | 1877 | 6.0% |
| S | 1819 | 5.8% |
| R | 1771 | 5.6% |
| H | 1304 | 4.2% |
| U | 1293 | 4.1% |
| Other values (9) | 6110 |
Common
| Value | Count | Frequency (%) |
| 1876 | ||
| / | 418 | 18.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33709 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 4113 | |
| E | 3743 | |
| A | 3461 | |
| T | 3194 | |
| O | 2730 | 8.1% |
| W | 1877 | 5.6% |
| 1876 | 5.6% | |
| S | 1819 | 5.4% |
| R | 1771 | 5.3% |
| H | 1304 | 3.9% |
| Other values (11) | 7821 |
numberofbuildings
Real number (ℝ)
SKEWED 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1340732 |
| Minimum | 1 |
|---|---|
| Maximum | 111 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 111 |
| Range | 110 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.1098734 |
|---|---|
| Coefficient of variation (CV) | 1.8604385 |
| Kurtosis | 2219.3838 |
| Mean | 1.1340732 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 43.709661 |
| Sum | 3781 |
| Variance | 4.4515659 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3235 | |
| 2 | 36 | 1.1% |
| 3 | 22 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 5 | 0.1% |
| 8 | 3 | 0.1% |
| 14 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| 10 | 2 | 0.1% |
| Other values (6) | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 3235 | |
| 2 | 36 | 1.1% |
| 3 | 22 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 5 | 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 9 | 2 | 0.1% |
| 10 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 111 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 14 | 2 | |
| 11 | 1 | < 0.1% |
| 10 | 2 | |
| 9 | 2 | |
| 8 | 3 | |
| 7 | 1 | < 0.1% |
numberoffloors
Real number (ℝ)
| Distinct | 49 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.7228554 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 12 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 5.5107542 |
|---|---|
| Coefficient of variation (CV) | 1.1668268 |
| Kurtosis | 55.885782 |
| Mean | 4.7228554 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 5.9257374 |
| Sum | 15746 |
| Variance | 30.368412 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 681 | |
| 4 | 678 | |
| 1 | 478 | |
| 2 | 433 | |
| 6 | 304 | |
| 5 | 294 | |
| 7 | 146 | 4.4% |
| 8 | 63 | 1.9% |
| 10 | 32 | 1.0% |
| 11 | 32 | 1.0% |
| Other values (39) | 193 | 5.8% |
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 2 | 433 | |
| 3 | 681 | |
| 4 | 678 | |
| 5 | 294 | |
| 6 | 304 | |
| 7 | 146 | 4.4% |
| 8 | 63 | 1.9% |
| 9 | 18 | 0.5% |
| 10 | 32 | 1.0% |
| Value | Count | Frequency (%) |
| 99 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 63 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| 46 | 1 | < 0.1% |
| 42 | 6 | |
| 41 | 3 |
propertygfatotal
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3159 |
|---|---|
| Distinct (%) | 94.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95408.128 |
| Minimum | 11285 |
|---|---|
| Maximum | 9320156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 11285 |
|---|---|
| 5-th percentile | 21301.95 |
| Q1 | 28534.75 |
| median | 44454 |
| Q3 | 91550 |
| 95-th percentile | 320811.45 |
| Maximum | 9320156 |
| Range | 9308871 |
| Interquartile range (IQR) | 63015.25 |
Descriptive statistics
| Standard deviation | 220109.49 |
|---|---|
| Coefficient of variation (CV) | 2.3070307 |
| Kurtosis | 935.94605 |
| Mean | 95408.128 |
| Median Absolute Deviation (MAD) | 20003.5 |
| Skewness | 24.004904 |
| Sum | 3.180907 × 108 |
| Variance | 4.8448185 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36000 | 9 | 0.3% |
| 25920 | 8 | 0.2% |
| 21600 | 7 | 0.2% |
| 28800 | 7 | 0.2% |
| 24000 | 6 | 0.2% |
| 22320 | 4 | 0.1% |
| 30240 | 4 | 0.1% |
| 30720 | 4 | 0.1% |
| 20000 | 3 | 0.1% |
| 43380 | 3 | 0.1% |
| Other values (3149) | 3279 |
| Value | Count | Frequency (%) |
| 11285 | 1 | |
| 11685 | 1 | |
| 11968 | 1 | |
| 12294 | 1 | |
| 12769 | 1 | |
| 13157 | 1 | |
| 13661 | 1 | |
| 14101 | 1 | |
| 15398 | 1 | |
| 16000 | 1 |
| Value | Count | Frequency (%) |
| 9320156 | 1 | |
| 2200000 | 1 | |
| 1952220 | 1 | |
| 1765970 | 1 | |
| 1605578 | 1 | |
| 1592914 | 1 | |
| 1585960 | 1 | |
| 1536606 | 1 | |
| 1400000 | 2 | |
| 1380959 | 1 |
propertygfaparking
Real number (ℝ)
ZEROS 
| Distinct | 491 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8072.251 |
| Minimum | 0 |
|---|---|
| Maximum | 512608 |
| Zeros | 2835 |
| Zeros (%) | 85.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 47839.05 |
| Maximum | 512608 |
| Range | 512608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 32511.558 |
|---|---|
| Coefficient of variation (CV) | 4.0275702 |
| Kurtosis | 58.284078 |
| Mean | 8072.251 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.6140599 |
| Sum | 26912885 |
| Variance | 1.0570014 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2835 | |
| 13320 | 3 | 0.1% |
| 12960 | 2 | 0.1% |
| 10800 | 2 | 0.1% |
| 100176 | 2 | 0.1% |
| 22000 | 2 | 0.1% |
| 30000 | 2 | 0.1% |
| 25800 | 2 | 0.1% |
| 20416 | 2 | 0.1% |
| 3029 | 1 | < 0.1% |
| Other values (481) | 481 | 14.4% |
| Value | Count | Frequency (%) |
| 0 | 2835 | |
| 38 | 1 | < 0.1% |
| 260 | 1 | < 0.1% |
| 415 | 1 | < 0.1% |
| 604 | 1 | < 0.1% |
| 756 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 919 | 1 | < 0.1% |
| 1263 | 1 | < 0.1% |
| 1392 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 512608 | 1 | |
| 407795 | 1 | |
| 389860 | 1 | |
| 368980 | 1 | |
| 335109 | 1 | |
| 327680 | 1 | |
| 319400 | 1 | |
| 303707 | 1 | |
| 285688 | 1 | |
| 285000 | 1 |
propertygfabuilding_s
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3157 |
|---|---|
| Distinct (%) | 94.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87335.877 |
| Minimum | 3636 |
|---|---|
| Maximum | 9320156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 3636 |
|---|---|
| 5-th percentile | 21038.6 |
| Q1 | 27794 |
| median | 43355 |
| Q3 | 84698.5 |
| 95-th percentile | 283450.4 |
| Maximum | 9320156 |
| Range | 9316520 |
| Interquartile range (IQR) | 56904.5 |
Descriptive statistics
| Standard deviation | 209159.59 |
|---|---|
| Coefficient of variation (CV) | 2.3948874 |
| Kurtosis | 1148.5293 |
| Mean | 87335.877 |
| Median Absolute Deviation (MAD) | 19067 |
| Skewness | 27.479087 |
| Sum | 2.9117782 × 108 |
| Variance | 4.3747733 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36000 | 9 | 0.3% |
| 25920 | 8 | 0.2% |
| 21600 | 7 | 0.2% |
| 28800 | 7 | 0.2% |
| 24000 | 6 | 0.2% |
| 30240 | 4 | 0.1% |
| 30720 | 4 | 0.1% |
| 22320 | 4 | 0.1% |
| 24288 | 3 | 0.1% |
| 25380 | 3 | 0.1% |
| Other values (3147) | 3279 |
| Value | Count | Frequency (%) |
| 3636 | 1 | |
| 10925 | 1 | |
| 11285 | 1 | |
| 11440 | 1 | |
| 11685 | 1 | |
| 11968 | 1 | |
| 12294 | 1 | |
| 12769 | 1 | |
| 12806 | 1 | |
| 13157 | 1 |
| Value | Count | Frequency (%) |
| 9320156 | 1 | |
| 2200000 | 1 | |
| 1765970 | 1 | |
| 1632820 | 1 | |
| 1592914 | 1 | |
| 1400000 | 1 | |
| 1380959 | 1 | |
| 1323055 | 1 | |
| 1258280 | 1 | |
| 1215718 | 1 |
| Distinct | 463 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
Length
| Max length | 255 |
|---|---|
| Median length | 162 |
| Mean length | 25.959808 |
| Min length | 5 |
Characters and Unicode
| Total characters | 86550 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 311 ? |
|---|---|
| Unique (%) | 9.3% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel, Parking, Restaurant |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel, Parking, Swimming Pool |
| Value | Count | Frequency (%) |
| multifamily | 1691 | |
| housing | 1691 | |
| parking | 1079 | |
| office | 951 | 9.7% |
| store | 467 | 4.8% |
| other | 415 | 4.2% |
| retail | 399 | 4.1% |
| warehouse | 277 | 2.8% |
| non-refrigerated | 260 | 2.7% |
| 180 | 1.8% | |
| Other values (97) | 2401 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 9282 | 10.7% |
| 6477 | 7.5% | |
| e | 5641 | 6.5% |
| a | 5202 | 6.0% |
| l | 4908 | 5.7% |
| t | 4898 | 5.7% |
| u | 4223 | 4.9% |
| r | 4215 | 4.9% |
| n | 4142 | 4.8% |
| o | 3977 | 4.6% |
| Other values (42) | 33585 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65782 | |
| Uppercase Letter | 10264 | 11.9% |
| Space Separator | 6477 | 7.5% |
| Other Punctuation | 3032 | 3.5% |
| Dash Punctuation | 629 | 0.7% |
| Decimal Number | 290 | 0.3% |
| Close Punctuation | 38 | < 0.1% |
| Open Punctuation | 38 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 9282 | |
| e | 5641 | 8.6% |
| a | 5202 | 7.9% |
| l | 4908 | 7.5% |
| t | 4898 | 7.4% |
| u | 4223 | 6.4% |
| r | 4215 | 6.4% |
| n | 4142 | 6.3% |
| o | 3977 | 6.0% |
| f | 3936 | 6.0% |
| Other values (12) | 15358 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1891 | |
| M | 1840 | |
| O | 1388 | |
| P | 1227 | |
| S | 1022 | |
| R | 954 | |
| W | 353 | 3.4% |
| C | 340 | 3.3% |
| N | 266 | 2.6% |
| F | 221 | 2.2% |
| Other values (11) | 762 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2659 | |
| / | 361 | 11.9% |
| & | 12 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 145 | |
| 1 | 145 |
Space Separator
| Value | Count | Frequency (%) |
| 6477 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 629 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 38 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 38 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76046 | |
| Common | 10504 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 9282 | 12.2% |
| e | 5641 | 7.4% |
| a | 5202 | 6.8% |
| l | 4908 | 6.5% |
| t | 4898 | 6.4% |
| u | 4223 | 5.6% |
| r | 4215 | 5.5% |
| n | 4142 | 5.4% |
| o | 3977 | 5.2% |
| f | 3936 | 5.2% |
| Other values (33) | 25622 |
Common
| Value | Count | Frequency (%) |
| 6477 | ||
| , | 2659 | |
| - | 629 | 6.0% |
| / | 361 | 3.4% |
| 2 | 145 | 1.4% |
| 1 | 145 | 1.4% |
| ) | 38 | 0.4% |
| ( | 38 | 0.4% |
| & | 12 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 9282 | 10.7% |
| 6477 | 7.5% | |
| e | 5641 | 6.5% |
| a | 5202 | 6.0% |
| l | 4908 | 5.7% |
| t | 4898 | 5.7% |
| u | 4223 | 4.9% |
| r | 4215 | 4.9% |
| n | 4142 | 4.8% |
| o | 3977 | 4.6% |
| Other values (42) | 33585 |
| Distinct | 56 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 11 |
| Missing (%) | 0.3% |
| Memory size | 26.2 KiB |
Length
| Max length | 52 |
|---|---|
| Median length | 19 |
| Mean length | 16.28378 |
| Min length | 5 |
Characters and Unicode
| Total characters | 54111 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Hotel |
| Value | Count | Frequency (%) |
| multifamily | 1651 | |
| housing | 1651 | |
| office | 536 | 8.8% |
| warehouse | 211 | 3.5% |
| non-refrigerated | 199 | 3.3% |
| other | 176 | 2.9% |
| store | 138 | 2.3% |
| k-12 | 137 | 2.2% |
| school | 137 | 2.2% |
| facility | 98 | 1.6% |
| Other values (79) | 1163 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6762 | 12.5% |
| l | 4091 | 7.6% |
| u | 3762 | 7.0% |
| t | 3069 | 5.7% |
| o | 3046 | 5.6% |
| e | 3016 | 5.6% |
| f | 2975 | 5.5% |
| a | 2780 | 5.1% |
| 2774 | 5.1% | |
| n | 2375 | 4.4% |
| Other values (41) | 19461 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43994 | |
| Uppercase Letter | 6396 | 11.8% |
| Space Separator | 2774 | 5.1% |
| Dash Punctuation | 443 | 0.8% |
| Decimal Number | 274 | 0.5% |
| Other Punctuation | 196 | 0.4% |
| Open Punctuation | 17 | < 0.1% |
| Close Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6762 | |
| l | 4091 | |
| u | 3762 | 8.6% |
| t | 3069 | 7.0% |
| o | 3046 | 6.9% |
| e | 3016 | 6.9% |
| f | 2975 | 6.8% |
| a | 2780 | 6.3% |
| n | 2375 | 5.4% |
| s | 2172 | 4.9% |
| Other values (11) | 9946 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1781 | |
| M | 1736 | |
| O | 724 | |
| S | 472 | 7.4% |
| R | 390 | 6.1% |
| W | 281 | 4.4% |
| C | 200 | 3.1% |
| N | 199 | 3.1% |
| K | 137 | 2.1% |
| F | 109 | 1.7% |
| Other values (11) | 367 | 5.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 166 | |
| , | 20 | 10.2% |
| & | 10 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 137 | |
| 1 | 137 |
Space Separator
| Value | Count | Frequency (%) |
| 2774 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 443 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50390 | |
| Common | 3721 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6762 | |
| l | 4091 | 8.1% |
| u | 3762 | 7.5% |
| t | 3069 | 6.1% |
| o | 3046 | 6.0% |
| e | 3016 | 6.0% |
| f | 2975 | 5.9% |
| a | 2780 | 5.5% |
| n | 2375 | 4.7% |
| s | 2172 | 4.3% |
| Other values (32) | 16342 |
Common
| Value | Count | Frequency (%) |
| 2774 | ||
| - | 443 | 11.9% |
| / | 166 | 4.5% |
| 2 | 137 | 3.7% |
| 1 | 137 | 3.7% |
| , | 20 | 0.5% |
| ( | 17 | 0.5% |
| ) | 17 | 0.5% |
| & | 10 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54111 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 6762 | 12.5% |
| l | 4091 | 7.6% |
| u | 3762 | 7.0% |
| t | 3069 | 5.7% |
| o | 3046 | 5.6% |
| e | 3016 | 5.6% |
| f | 2975 | 5.5% |
| a | 2780 | 5.1% |
| 2774 | 5.1% | |
| n | 2375 | 4.4% |
| Other values (41) | 19461 |
largestpropertyusetypegfa
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3096 |
|---|---|
| Distinct (%) | 93.2% |
| Missing | 11 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79499.113 |
| Minimum | 5656 |
|---|---|
| Maximum | 9320156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 5656 |
|---|---|
| 5-th percentile | 17528.6 |
| Q1 | 25148.5 |
| median | 39960 |
| Q3 | 76902.5 |
| 95-th percentile | 244975.2 |
| Maximum | 9320156 |
| Range | 9314500 |
| Interquartile range (IQR) | 51754 |
Descriptive statistics
| Standard deviation | 202640.47 |
|---|---|
| Coefficient of variation (CV) | 2.5489652 |
| Kurtosis | 1309.0358 |
| Mean | 79499.113 |
| Median Absolute Deviation (MAD) | 17619 |
| Skewness | 29.97068 |
| Sum | 2.6417555 × 108 |
| Variance | 4.1063162 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24000 | 9 | 0.3% |
| 22000 | 8 | 0.2% |
| 30000 | 8 | 0.2% |
| 21600 | 7 | 0.2% |
| 20000 | 7 | 0.2% |
| 15000 | 5 | 0.1% |
| 45000 | 5 | 0.1% |
| 24288 | 5 | 0.1% |
| 36000 | 5 | 0.1% |
| 28800 | 5 | 0.1% |
| Other values (3086) | 3259 | |
| (Missing) | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 5656 | 1 | |
| 6455 | 1 | |
| 6601 | 1 | |
| 6900 | 1 | |
| 7245 | 1 | |
| 7387 | 1 | |
| 7501 | 1 | |
| 7583 | 1 | |
| 7758 | 1 | |
| 8061 | 1 |
| Value | Count | Frequency (%) |
| 9320156 | 1 | |
| 1719643 | 1 | |
| 1680937 | 1 | |
| 1639334 | 1 | |
| 1585960 | 1 | |
| 1350182 | 1 | |
| 1314475 | 1 | |
| 1191115 | 1 | |
| 1172127 | 1 | |
| 1072000 | 1 |
energystarscore
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 100 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 825 |
| Missing (%) | 24.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.81666 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 53 |
| median | 75 |
| Q3 | 90 |
| 95-th percentile | 99 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 26.705492 |
|---|---|
| Coefficient of variation (CV) | 0.39378954 |
| Kurtosis | -0.22156724 |
| Mean | 67.81666 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | -0.85580171 |
| Sum | 170152 |
| Variance | 713.18328 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 92 | 2.8% |
| 98 | 72 | 2.2% |
| 96 | 64 | 1.9% |
| 89 | 58 | 1.7% |
| 93 | 57 | 1.7% |
| 92 | 53 | 1.6% |
| 95 | 51 | 1.5% |
| 94 | 49 | 1.5% |
| 91 | 49 | 1.5% |
| 99 | 48 | 1.4% |
| Other values (90) | 1916 | |
| (Missing) | 825 |
| Value | Count | Frequency (%) |
| 1 | 33 | |
| 2 | 10 | 0.3% |
| 3 | 13 | 0.4% |
| 4 | 5 | 0.1% |
| 5 | 8 | 0.2% |
| 6 | 8 | 0.2% |
| 7 | 10 | 0.3% |
| 8 | 10 | 0.3% |
| 9 | 5 | 0.1% |
| 10 | 10 | 0.3% |
| Value | Count | Frequency (%) |
| 100 | 92 | |
| 99 | 48 | |
| 98 | 72 | |
| 97 | 48 | |
| 96 | 64 | |
| 95 | 51 | |
| 94 | 49 | |
| 93 | 57 | |
| 92 | 53 | |
| 91 | 49 |
siteeui_kbtu_sf
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1066 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 2 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.763475 |
| Minimum | 0 |
|---|---|
| Maximum | 834.40002 |
| Zeros | 16 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18.4 |
| Q1 | 28.1 |
| median | 38.799999 |
| Q3 | 60.400002 |
| 95-th percentile | 144.5 |
| Maximum | 834.40002 |
| Range | 834.40002 |
| Interquartile range (IQR) | 32.300001 |
Descriptive statistics
| Standard deviation | 55.938665 |
|---|---|
| Coefficient of variation (CV) | 1.0214594 |
| Kurtosis | 41.236394 |
| Mean | 54.763475 |
| Median Absolute Deviation (MAD) | 13.400002 |
| Skewness | 5.0651096 |
| Sum | 182471.9 |
| Variance | 3129.1343 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24.70000076 | 17 | 0.5% |
| 28.79999924 | 17 | 0.5% |
| 24.20000076 | 16 | 0.5% |
| 0 | 16 | 0.5% |
| 32 | 15 | 0.4% |
| 28.89999962 | 14 | 0.4% |
| 31.70000076 | 14 | 0.4% |
| 26.39999962 | 14 | 0.4% |
| 26.60000038 | 13 | 0.4% |
| 22.79999924 | 13 | 0.4% |
| Other values (1056) | 3183 |
| Value | Count | Frequency (%) |
| 0 | 16 | |
| 1.399999976 | 1 | < 0.1% |
| 2.099999905 | 1 | < 0.1% |
| 2.299999952 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.200000048 | 1 | < 0.1% |
| 3.5 | 2 | 0.1% |
| 3.599999905 | 2 | 0.1% |
| 3.799999952 | 1 | < 0.1% |
| 4.300000191 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 834.4000244 | 1 | |
| 707.2999878 | 1 | |
| 696.7000122 | 1 | |
| 694.7000122 | 1 | |
| 639.7000122 | 1 | |
| 593.5999756 | 1 | |
| 465.5 | 1 | |
| 456.6000061 | 1 | |
| 438.2000122 | 1 | |
| 412.7000122 | 1 |
siteeuiwn_kbtu_sf
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1085 |
|---|---|
| Distinct (%) | 32.6% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.073987 |
| Minimum | 0 |
|---|---|
| Maximum | 834.40002 |
| Zeros | 29 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18.700001 |
| Q1 | 29.5 |
| median | 41 |
| Q3 | 64.300003 |
| 95-th percentile | 147.7 |
| Maximum | 834.40002 |
| Range | 834.40002 |
| Interquartile range (IQR) | 34.800003 |
Descriptive statistics
| Standard deviation | 56.819416 |
|---|---|
| Coefficient of variation (CV) | 0.99553963 |
| Kurtosis | 38.822409 |
| Mean | 57.073987 |
| Median Absolute Deviation (MAD) | 14.200001 |
| Skewness | 4.9093207 |
| Sum | 190227.6 |
| Variance | 3228.446 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 29 | 0.9% |
| 29.5 | 17 | 0.5% |
| 30.79999924 | 15 | 0.4% |
| 29 | 14 | 0.4% |
| 31.60000038 | 14 | 0.4% |
| 27.89999962 | 14 | 0.4% |
| 32.20000076 | 14 | 0.4% |
| 30.20000076 | 14 | 0.4% |
| 33.59999847 | 13 | 0.4% |
| 28.10000038 | 13 | 0.4% |
| Other values (1075) | 3176 |
| Value | Count | Frequency (%) |
| 0 | 29 | |
| 1.5 | 1 | < 0.1% |
| 2.099999905 | 1 | < 0.1% |
| 2.299999952 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 3.200000048 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 3.599999905 | 2 | 0.1% |
| 4 | 1 | < 0.1% |
| 4.300000191 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 834.4000244 | 1 | |
| 707.2999878 | 1 | |
| 694.7000122 | 1 | |
| 693.0999756 | 1 | |
| 639.7999878 | 1 | |
| 593.5999756 | 1 | |
| 468.7000122 | 1 | |
| 467 | 1 | |
| 460.1000061 | 1 | |
| 426.6000061 | 1 |
sourceeui_kbtu_sf
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1623 |
|---|---|
| Distinct (%) | 48.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 134.22463 |
| Minimum | 0 |
|---|---|
| Maximum | 2620 |
| Zeros | 24 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41.494999 |
| Q1 | 75 |
| median | 96.400002 |
| Q3 | 143.875 |
| 95-th percentile | 349.455 |
| Maximum | 2620 |
| Range | 2620 |
| Interquartile range (IQR) | 68.874996 |
Descriptive statistics
| Standard deviation | 137.78693 |
|---|---|
| Coefficient of variation (CV) | 1.0265399 |
| Kurtosis | 81.265223 |
| Mean | 134.22463 |
| Median Absolute Deviation (MAD) | 27.5 |
| Skewness | 6.7414881 |
| Sum | 447504.9 |
| Variance | 18985.239 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 24 | 0.7% |
| 83.69999695 | 9 | 0.3% |
| 68.09999847 | 9 | 0.3% |
| 73.09999847 | 8 | 0.2% |
| 78.59999847 | 8 | 0.2% |
| 69.69999695 | 8 | 0.2% |
| 90.5 | 8 | 0.2% |
| 95 | 8 | 0.2% |
| 94.09999847 | 8 | 0.2% |
| 87.69999695 | 8 | 0.2% |
| Other values (1613) | 3236 |
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 4.5 | 1 | < 0.1% |
| 6.599999905 | 2 | 0.1% |
| 6.900000095 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 9.5 | 1 | < 0.1% |
| 9.899999619 | 1 | < 0.1% |
| 10.19999981 | 1 | < 0.1% |
| 11.10000038 | 1 | < 0.1% |
| 11.19999981 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2620 | 1 | |
| 2217.800049 | 1 | |
| 2181.300049 | 1 | |
| 2007.900024 | 1 | |
| 1527.300049 | 1 | |
| 1206.699951 | 1 | |
| 1150.300049 | 1 | |
| 1026.599976 | 1 | |
| 962.0999756 | 1 | |
| 912.7999878 | 1 |
sourceeuiwn_kbtu_sf
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1669 |
|---|---|
| Distinct (%) | 50.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.78359 |
| Minimum | 0 |
|---|---|
| Maximum | 2620 |
| Zeros | 36 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41.665 |
| Q1 | 78.724998 |
| median | 101.3 |
| Q3 | 148.3 |
| 95-th percentile | 350.285 |
| Maximum | 2620 |
| Range | 2620 |
| Interquartile range (IQR) | 69.575005 |
Descriptive statistics
| Standard deviation | 137.55422 |
|---|---|
| Coefficient of variation (CV) | 0.99833529 |
| Kurtosis | 81.154874 |
| Mean | 137.78359 |
| Median Absolute Deviation (MAD) | 28.300003 |
| Skewness | 6.7209135 |
| Sum | 459370.5 |
| Variance | 18921.165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36 | 1.1% |
| 73.59999847 | 9 | 0.3% |
| 87.30000305 | 9 | 0.3% |
| 75.5 | 8 | 0.2% |
| 98.90000153 | 8 | 0.2% |
| 83.5 | 8 | 0.2% |
| 102.4000015 | 8 | 0.2% |
| 93.59999847 | 8 | 0.2% |
| 104.5999985 | 8 | 0.2% |
| 84.90000153 | 8 | 0.2% |
| Other values (1659) | 3224 |
| Value | Count | Frequency (%) |
| 0 | 36 | |
| 4.599999905 | 1 | < 0.1% |
| 6.599999905 | 1 | < 0.1% |
| 6.900000095 | 1 | < 0.1% |
| 7.400000095 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 9.5 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 10.30000019 | 1 | < 0.1% |
| 11.19999981 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2620 | 1 | |
| 2217.800049 | 1 | |
| 2181.300049 | 1 | |
| 2008 | 1 | |
| 1527.300049 | 1 | |
| 1195.099976 | 1 | |
| 1138.400024 | 1 | |
| 1001 | 1 | |
| 954 | 1 | |
| 919.2999878 | 1 |
siteenergyuse_kbtu
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3317 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5421880.3 |
| Minimum | 0 |
|---|---|
| Maximum | 8.7392371 × 108 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 507617.73 |
| Q1 | 936592.41 |
| median | 1812395.8 |
| Q3 | 4223663.9 |
| 95-th percentile | 18143745 |
| Maximum | 8.7392371 × 108 |
| Range | 8.7392371 × 108 |
| Interquartile range (IQR) | 3287071.5 |
Descriptive statistics
| Standard deviation | 21712336 |
|---|---|
| Coefficient of variation (CV) | 4.0045767 |
| Kurtosis | 851.90528 |
| Mean | 5421880.3 |
| Median Absolute Deviation (MAD) | 1071607.6 |
| Skewness | 24.762482 |
| Sum | 1.8076549 × 1010 |
| Variance | 4.7142552 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 18 | 0.5% |
| 7226362.5 | 1 | < 0.1% |
| 958242.875 | 1 | < 0.1% |
| 1206165.75 | 1 | < 0.1% |
| 1302192.875 | 1 | < 0.1% |
| 150167.7969 | 1 | < 0.1% |
| 1386445.375 | 1 | < 0.1% |
| 1331469.75 | 1 | < 0.1% |
| 421389.4063 | 1 | < 0.1% |
| 12213423 | 1 | < 0.1% |
| Other values (3307) | 3307 |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 57133.19922 | 1 | < 0.1% |
| 79711.79688 | 1 | < 0.1% |
| 90558.70313 | 1 | < 0.1% |
| 97690.39844 | 1 | < 0.1% |
| 106918 | 1 | < 0.1% |
| 111969.7031 | 1 | < 0.1% |
| 113130 | 1 | < 0.1% |
| 116486.6016 | 1 | < 0.1% |
| 117438.3984 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 873923712 | 1 | |
| 448385312 | 1 | |
| 293090784 | 1 | |
| 291614432 | 1 | |
| 274682208 | 1 | |
| 253832464 | 1 | |
| 163945984 | 1 | |
| 143423024 | 1 | |
| 131373880 | 1 | |
| 114648520 | 1 |
siteenergyusewn_kbtu
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3304 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5292551.5 |
| Minimum | 0 |
|---|---|
| Maximum | 4.7161386 × 108 |
| Zeros | 29 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 527651.54 |
| Q1 | 986329.5 |
| median | 1916863.4 |
| Q3 | 4381679 |
| 95-th percentile | 18192559 |
| Maximum | 4.7161386 × 108 |
| Range | 4.7161386 × 108 |
| Interquartile range (IQR) | 3395349.5 |
Descriptive statistics
| Standard deviation | 16002488 |
|---|---|
| Coefficient of variation (CV) | 3.0235867 |
| Kurtosis | 332.78844 |
| Mean | 5292551.5 |
| Median Absolute Deviation (MAD) | 1128377.7 |
| Skewness | 15.247975 |
| Sum | 1.7640074 × 1010 |
| Variance | 2.5607963 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 29 | 0.9% |
| 2127889.25 | 2 | 0.1% |
| 6739209 | 1 | < 0.1% |
| 1024822.188 | 1 | < 0.1% |
| 1342448.875 | 1 | < 0.1% |
| 909471.875 | 1 | < 0.1% |
| 509741.1875 | 1 | < 0.1% |
| 1355995.25 | 1 | < 0.1% |
| 1439042.25 | 1 | < 0.1% |
| 150167.7969 | 1 | < 0.1% |
| Other values (3294) | 3294 |
| Value | Count | Frequency (%) |
| 0 | 29 | |
| 58114.19922 | 1 | < 0.1% |
| 79967.89844 | 1 | < 0.1% |
| 90558.70313 | 1 | < 0.1% |
| 98862.89844 | 1 | < 0.1% |
| 109471.7969 | 1 | < 0.1% |
| 116486.6016 | 1 | < 0.1% |
| 116642.5 | 1 | < 0.1% |
| 120610.5 | 1 | < 0.1% |
| 127374 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 471613856 | 1 | |
| 296671744 | 1 | |
| 295929888 | 1 | |
| 274725984 | 1 | |
| 257764208 | 1 | |
| 167207104 | 1 | |
| 147299056 | 1 | |
| 137106112 | 1 | |
| 123205560 | 1 | |
| 103985264 | 1 |
steamuse
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True | 129 |
| Value | Count | Frequency (%) |
| False | 3205 | |
| True | 129 | 3.9% |
electricity
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| True | |
|---|---|
| False | 14 |
| Value | Count | Frequency (%) |
| True | 3320 | |
| False | 14 | 0.4% |
naturalgas
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 2094 | |
| False | 1240 |
defaultdata
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True | 111 |
| Value | Count | Frequency (%) |
| False | 3223 | |
| True | 111 | 3.3% |
compliancestatus
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 126 |
| Missing (%) | 3.8% |
| Memory size | 26.2 KiB |
| Compliant | |
|---|---|
| Non-Compliant | 2 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.0024938 |
| Min length | 9 |
Characters and Unicode
| Total characters | 28880 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Compliant |
|---|---|
| 2nd row | Compliant |
| 3rd row | Compliant |
| 4th row | Compliant |
| 5th row | Compliant |
Common Values
| Value | Count | Frequency (%) |
| Compliant | 3206 | |
| Non-Compliant | 2 | 0.1% |
| (Missing) | 126 | 3.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| compliant | 3206 | |
| non-compliant | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3210 | |
| n | 3210 | |
| C | 3208 | |
| m | 3208 | |
| p | 3208 | |
| l | 3208 | |
| i | 3208 | |
| a | 3208 | |
| t | 3208 | |
| N | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25668 | |
| Uppercase Letter | 3210 | 11.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3210 | |
| n | 3210 | |
| m | 3208 | |
| p | 3208 | |
| l | 3208 | |
| i | 3208 | |
| a | 3208 | |
| t | 3208 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3208 | |
| N | 2 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28878 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3210 | |
| n | 3210 | |
| C | 3208 | |
| m | 3208 | |
| p | 3208 | |
| l | 3208 | |
| i | 3208 | |
| a | 3208 | |
| t | 3208 | |
| N | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3210 | |
| n | 3210 | |
| C | 3208 | |
| m | 3208 | |
| p | 3208 | |
| l | 3208 | |
| i | 3208 | |
| a | 3208 | |
| t | 3208 | |
| N | 2 | < 0.1% |
totalghgemissions
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2790 |
|---|---|
| Distinct (%) | 83.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.14511 |
| Minimum | 0 |
|---|---|
| Maximum | 16870.98 |
| Zeros | 9 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.9 |
| Q1 | 9.6625 |
| median | 34.125 |
| Q3 | 94.0175 |
| 95-th percentile | 392.4785 |
| Maximum | 16870.98 |
| Range | 16870.98 |
| Interquartile range (IQR) | 84.355 |
Descriptive statistics
| Standard deviation | 541.24136 |
|---|---|
| Coefficient of variation (CV) | 4.5048972 |
| Kurtosis | 471.03548 |
| Mean | 120.14511 |
| Median Absolute Deviation (MAD) | 28.04 |
| Skewness | 19.410543 |
| Sum | 400563.79 |
| Variance | 292942.21 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9 | 0.3% |
| 3.95 | 7 | 0.2% |
| 4.2 | 6 | 0.2% |
| 6.18 | 5 | 0.1% |
| 3.54 | 5 | 0.1% |
| 4.52 | 5 | 0.1% |
| 4.43 | 5 | 0.1% |
| 4.8 | 5 | 0.1% |
| 5.46 | 5 | 0.1% |
| 9.29 | 5 | 0.1% |
| Other values (2780) | 3277 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 0.4 | 1 | < 0.1% |
| 0.63 | 1 | < 0.1% |
| 0.68 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 0.79 | 1 | < 0.1% |
| 0.81 | 1 | < 0.1% |
| 0.82 | 1 | < 0.1% |
| 0.86 | 1 | < 0.1% |
| 0.87 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 16870.98 | 1 | |
| 12307.16 | 1 | |
| 11140.56 | 1 | |
| 10734.57 | 1 | |
| 8145.52 | 1 | |
| 6330.91 | 1 | |
| 4906.33 | 1 | |
| 3995.45 | 1 | |
| 3768.66 | 1 | |
| 3278.11 | 1 |
zipcode
Real number (ℝ)
| Distinct | 60 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98117.005 |
| Minimum | 98006 |
|---|---|
| Maximum | 98272 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 98006 |
|---|---|
| 5-th percentile | 98101 |
| Q1 | 98105 |
| median | 98115 |
| Q3 | 98122 |
| 95-th percentile | 98144 |
| Maximum | 98272 |
| Range | 266 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 18.659486 |
|---|---|
| Coefficient of variation (CV) | 0.00019017586 |
| Kurtosis | 10.444809 |
| Mean | 98117.005 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.994031 |
| Sum | 3.2712209 × 108 |
| Variance | 348.17641 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98109 | 292 | 8.8% |
| 98104 | 246 | 7.4% |
| 98122 | 239 | 7.2% |
| 98101 | 226 | 6.8% |
| 98105 | 186 | 5.6% |
| 98121 | 185 | 5.5% |
| 98134 | 184 | 5.5% |
| 98102 | 167 | 5.0% |
| 98119 | 165 | 4.9% |
| 98103 | 160 | 4.8% |
| Other values (50) | 1284 |
| Value | Count | Frequency (%) |
| 98006 | 1 | |
| 98011 | 1 | |
| 98012 | 1 | |
| 98013 | 2 | |
| 98020 | 1 | |
| 98028 | 1 | |
| 98033 | 1 | |
| 98040 | 1 | |
| 98053 | 1 | |
| 98070 | 1 |
| Value | Count | Frequency (%) |
| 98272 | 1 | < 0.1% |
| 98204 | 1 | < 0.1% |
| 98199 | 70 | |
| 98198 | 1 | < 0.1% |
| 98195 | 10 | 0.3% |
| 98191 | 1 | < 0.1% |
| 98185 | 1 | < 0.1% |
| 98181 | 1 | < 0.1% |
| 98178 | 4 | 0.1% |
| 98177 | 2 | 0.1% |
latitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2848 |
|---|---|
| Distinct (%) | 85.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.624195 |
| Minimum | 47.49917 |
|---|---|
| Maximum | 47.73387 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 47.49917 |
|---|---|
| 5-th percentile | 47.541602 |
| Q1 | 47.60012 |
| median | 47.618835 |
| Q3 | 47.657232 |
| 95-th percentile | 47.713091 |
| Maximum | 47.73387 |
| Range | 0.2347 |
| Interquartile range (IQR) | 0.0571125 |
Descriptive statistics
| Standard deviation | 0.047823298 |
|---|---|
| Coefficient of variation (CV) | 0.0010041807 |
| Kurtosis | -0.14467741 |
| Mean | 47.624195 |
| Median Absolute Deviation (MAD) | 0.028415 |
| Skewness | 0.13760485 |
| Sum | 158779.07 |
| Variance | 0.0022870678 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.66246 | 9 | 0.3% |
| 47.61598 | 7 | 0.2% |
| 47.62208 | 6 | 0.2% |
| 47.62395 | 5 | 0.1% |
| 47.52549 | 5 | 0.1% |
| 47.61543 | 5 | 0.1% |
| 47.6239 | 4 | 0.1% |
| 47.52254 | 4 | 0.1% |
| 47.5829 | 4 | 0.1% |
| 47.61048 | 4 | 0.1% |
| Other values (2838) | 3281 |
| Value | Count | Frequency (%) |
| 47.49917 | 1 | |
| 47.50061895 | 1 | |
| 47.50224 | 1 | |
| 47.50959 | 1 | |
| 47.5097 | 1 | |
| 47.51018 | 1 | |
| 47.51042 | 1 | |
| 47.51098 | 1 | |
| 47.51104 | 1 | |
| 47.51127 | 2 |
| Value | Count | Frequency (%) |
| 47.73387 | 1 | |
| 47.73375 | 1 | |
| 47.73368 | 1 | |
| 47.7336 | 1 | |
| 47.73357 | 1 | |
| 47.73351 | 1 | |
| 47.73331 | 1 | |
| 47.73316 | 1 | |
| 47.73315 | 1 | |
| 47.73279 | 1 |
longitude
Real number (ℝ)
| Distinct | 2632 |
|---|---|
| Distinct (%) | 78.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.33475 |
| Minimum | -122.41425 |
|---|---|
| Maximum | -122.22097 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3334 |
| Negative (%) | 100.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | -122.41425 |
|---|---|
| 5-th percentile | -122.38651 |
| Q1 | -122.35053 |
| median | -122.33248 |
| Q3 | -122.31943 |
| 95-th percentile | -122.28981 |
| Maximum | -122.22097 |
| Range | 0.1932841 |
| Interquartile range (IQR) | 0.031095 |
Descriptive statistics
| Standard deviation | 0.027164202 |
|---|---|
| Coefficient of variation (CV) | -0.00022204813 |
| Kurtosis | 0.26755314 |
| Mean | -122.33475 |
| Median Absolute Deviation (MAD) | 0.015075 |
| Skewness | -0.13665451 |
| Sum | -407864.05 |
| Variance | 0.00073789385 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.29898 | 8 | 0.2% |
| -122.35398 | 7 | 0.2% |
| -122.32468 | 6 | 0.2% |
| -122.33369 | 6 | 0.2% |
| -122.32592 | 5 | 0.1% |
| -122.32417 | 5 | 0.1% |
| -122.33379 | 5 | 0.1% |
| -122.33064 | 5 | 0.1% |
| -122.31769 | 5 | 0.1% |
| -122.3255 | 4 | 0.1% |
| Other values (2622) | 3278 |
| Value | Count | Frequency (%) |
| -122.41425 | 1 | |
| -122.41182 | 1 | |
| -122.41178 | 1 | |
| -122.41169 | 1 | |
| -122.41037 | 1 | |
| -122.41036 | 1 | |
| -122.41031 | 1 | |
| -122.40976 | 1 | |
| -122.40974 | 1 | |
| -122.40901 | 1 |
| Value | Count | Frequency (%) |
| -122.2209659 | 1 | |
| -122.25864 | 1 | |
| -122.26028 | 1 | |
| -122.26034 | 1 | |
| -122.26166 | 2 | |
| -122.26172 | 1 | |
| -122.26177 | 1 | |
| -122.2618 | 1 | |
| -122.26216 | 1 | |
| -122.26223 | 1 |
age
Real number (ℝ)
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.266047 |
| Minimum | 8 |
|---|---|
| Maximum | 123 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 26 |
| median | 48 |
| Q3 | 74.75 |
| 95-th percentile | 115 |
| Maximum | 123 |
| Range | 115 |
| Interquartile range (IQR) | 48.75 |
Descriptive statistics
| Standard deviation | 33.014307 |
|---|---|
| Coefficient of variation (CV) | 0.60837871 |
| Kurtosis | -0.86255203 |
| Mean | 54.266047 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 0.54463883 |
| Sum | 180923 |
| Variance | 1089.9445 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 71 | 2.1% |
| 9 | 67 | 2.0% |
| 34 | 65 | 1.9% |
| 15 | 65 | 1.9% |
| 35 | 64 | 1.9% |
| 24 | 64 | 1.9% |
| 55 | 63 | 1.9% |
| 22 | 59 | 1.8% |
| 21 | 59 | 1.8% |
| 33 | 59 | 1.8% |
| Other values (103) | 2698 |
| Value | Count | Frequency (%) |
| 8 | 35 | |
| 9 | 67 | |
| 10 | 50 | |
| 11 | 35 | |
| 12 | 15 | 0.4% |
| 13 | 24 | 0.7% |
| 14 | 41 | |
| 15 | 65 | |
| 16 | 42 | |
| 17 | 45 |
| Value | Count | Frequency (%) |
| 123 | 53 | |
| 122 | 8 | 0.2% |
| 121 | 11 | 0.3% |
| 120 | 3 | 0.1% |
| 119 | 14 | 0.4% |
| 118 | 9 | 0.3% |
| 117 | 18 | 0.5% |
| 116 | 31 | |
| 115 | 27 | |
| 114 | 32 |
| Unnamed: 0 | osebuildingid | councildistrictcode | numberofbuildings | numberoffloors | propertygfatotal | propertygfaparking | propertygfabuilding_s | largestpropertyusetypegfa | energystarscore | siteeui_kbtu_sf | siteeuiwn_kbtu_sf | sourceeui_kbtu_sf | sourceeuiwn_kbtu_sf | siteenergyuse_kbtu | siteenergyusewn_kbtu | totalghgemissions | zipcode | latitude | longitude | age | buildingtype | primarypropertytype | neighborhood | steamuse | electricity | naturalgas | defaultdata | compliancestatus | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.998 | -0.147 | 0.005 | -0.010 | -0.289 | -0.194 | -0.278 | -0.271 | 0.085 | -0.182 | -0.181 | -0.182 | -0.181 | -0.274 | -0.276 | -0.227 | 0.095 | 0.099 | 0.125 | -0.147 | 0.207 | 0.235 | 0.190 | 0.259 | 0.040 | 0.121 | 0.128 | 0.000 |
| osebuildingid | 0.998 | 1.000 | -0.146 | 0.005 | -0.012 | -0.289 | -0.194 | -0.278 | -0.270 | 0.085 | -0.181 | -0.180 | -0.180 | -0.180 | -0.272 | -0.275 | -0.226 | 0.096 | 0.099 | 0.126 | -0.147 | 0.203 | 0.253 | 0.157 | 0.209 | 0.025 | 0.133 | 0.063 | 0.000 |
| councildistrictcode | -0.147 | -0.146 | 1.000 | -0.039 | 0.335 | 0.155 | 0.153 | 0.145 | 0.127 | 0.074 | 0.092 | 0.078 | 0.109 | 0.099 | 0.146 | 0.138 | 0.120 | -0.194 | 0.512 | -0.349 | -0.001 | 0.149 | 0.252 | 0.880 | 0.214 | 0.031 | 0.142 | 0.098 | 0.000 |
| numberofbuildings | 0.005 | 0.005 | -0.039 | 1.000 | -0.042 | 0.102 | -0.004 | 0.103 | 0.118 | 0.028 | 0.043 | 0.035 | 0.043 | 0.036 | 0.113 | 0.103 | 0.098 | 0.037 | 0.032 | 0.054 | -0.046 | 0.238 | 0.153 | 0.048 | 0.081 | 0.000 | 0.000 | 0.000 | 0.000 |
| numberoffloors | -0.010 | -0.012 | 0.335 | -0.042 | 1.000 | 0.442 | 0.262 | 0.434 | 0.415 | 0.126 | 0.022 | -0.004 | 0.096 | 0.081 | 0.289 | 0.274 | 0.173 | -0.230 | 0.064 | -0.114 | -0.293 | 0.246 | 0.263 | 0.137 | 0.263 | 0.000 | 0.047 | 0.000 | 0.000 |
| propertygfatotal | -0.289 | -0.289 | 0.155 | 0.102 | 0.442 | 1.000 | 0.346 | 0.983 | 0.930 | 0.082 | 0.185 | 0.158 | 0.209 | 0.185 | 0.757 | 0.741 | 0.580 | -0.092 | -0.057 | -0.021 | -0.313 | 0.144 | 0.173 | 0.060 | 0.146 | 0.044 | 0.021 | 0.000 | 0.150 |
| propertygfaparking | -0.194 | -0.194 | 0.153 | -0.004 | 0.262 | 0.346 | 1.000 | 0.222 | 0.272 | 0.014 | 0.198 | 0.177 | 0.246 | 0.229 | 0.305 | 0.292 | 0.207 | -0.125 | 0.015 | -0.053 | -0.239 | 0.052 | 0.156 | 0.061 | 0.084 | 0.000 | 0.016 | 0.000 | 0.000 |
| propertygfabuilding_s | -0.278 | -0.278 | 0.145 | 0.103 | 0.434 | 0.983 | 0.222 | 1.000 | 0.928 | 0.082 | 0.161 | 0.136 | 0.178 | 0.155 | 0.742 | 0.726 | 0.576 | -0.080 | -0.066 | -0.016 | -0.285 | 0.166 | 0.190 | 0.050 | 0.122 | 0.050 | 0.017 | 0.000 | 0.162 |
| largestpropertyusetypegfa | -0.271 | -0.270 | 0.127 | 0.118 | 0.415 | 0.930 | 0.272 | 0.928 | 1.000 | 0.094 | 0.120 | 0.097 | 0.126 | 0.104 | 0.722 | 0.708 | 0.566 | -0.053 | -0.049 | -0.012 | -0.291 | 0.148 | 0.226 | 0.050 | 0.143 | 0.055 | 0.018 | 0.000 | 0.168 |
| energystarscore | 0.085 | 0.085 | 0.074 | 0.028 | 0.126 | 0.082 | 0.014 | 0.082 | 0.094 | 1.000 | -0.447 | -0.447 | -0.515 | -0.524 | -0.174 | -0.174 | -0.099 | -0.002 | 0.086 | -0.035 | -0.078 | 0.119 | 0.121 | 0.056 | 0.000 | 0.000 | 0.102 | 0.110 | 1.000 |
| siteeui_kbtu_sf | -0.182 | -0.181 | 0.092 | 0.043 | 0.022 | 0.185 | 0.198 | 0.161 | 0.120 | -0.447 | 1.000 | 0.987 | 0.869 | 0.867 | 0.707 | 0.699 | 0.707 | -0.131 | -0.083 | 0.045 | 0.063 | 0.137 | 0.280 | 0.057 | 0.135 | 0.020 | 0.148 | 0.053 | 1.000 |
| siteeuiwn_kbtu_sf | -0.181 | -0.180 | 0.078 | 0.035 | -0.004 | 0.158 | 0.177 | 0.136 | 0.097 | -0.447 | 0.987 | 1.000 | 0.839 | 0.861 | 0.685 | 0.702 | 0.704 | -0.126 | -0.086 | 0.044 | 0.086 | 0.137 | 0.271 | 0.054 | 0.118 | 0.000 | 0.176 | 0.059 | 0.000 |
| sourceeui_kbtu_sf | -0.182 | -0.180 | 0.109 | 0.043 | 0.096 | 0.209 | 0.246 | 0.178 | 0.126 | -0.515 | 0.869 | 0.839 | 1.000 | 0.986 | 0.636 | 0.618 | 0.463 | -0.107 | -0.052 | 0.028 | -0.061 | 0.113 | 0.243 | 0.027 | 0.036 | 0.000 | 0.068 | 0.025 | 0.000 |
| sourceeuiwn_kbtu_sf | -0.181 | -0.180 | 0.099 | 0.036 | 0.081 | 0.185 | 0.229 | 0.155 | 0.104 | -0.524 | 0.867 | 0.861 | 0.986 | 1.000 | 0.618 | 0.626 | 0.455 | -0.107 | -0.053 | 0.029 | -0.040 | 0.114 | 0.246 | 0.022 | 0.034 | 0.000 | 0.069 | 0.027 | 0.000 |
| siteenergyuse_kbtu | -0.274 | -0.272 | 0.146 | 0.113 | 0.289 | 0.757 | 0.305 | 0.742 | 0.722 | -0.174 | 0.707 | 0.685 | 0.636 | 0.618 | 1.000 | 0.986 | 0.873 | -0.122 | -0.095 | 0.020 | -0.160 | 0.156 | 0.276 | 0.000 | 0.127 | 0.000 | 0.023 | 0.000 | 0.000 |
| siteenergyusewn_kbtu | -0.276 | -0.275 | 0.138 | 0.103 | 0.274 | 0.741 | 0.292 | 0.726 | 0.708 | -0.174 | 0.699 | 0.702 | 0.618 | 0.626 | 0.986 | 1.000 | 0.871 | -0.118 | -0.097 | 0.019 | -0.148 | 0.139 | 0.298 | 0.041 | 0.210 | 0.000 | 0.048 | 0.000 | 0.000 |
| totalghgemissions | -0.227 | -0.226 | 0.120 | 0.098 | 0.173 | 0.580 | 0.207 | 0.576 | 0.566 | -0.099 | 0.707 | 0.704 | 0.463 | 0.455 | 0.873 | 0.871 | 1.000 | -0.129 | -0.113 | 0.025 | -0.025 | 0.126 | 0.259 | 0.000 | 0.198 | 0.000 | 0.034 | 0.000 | 0.000 |
| zipcode | 0.095 | 0.096 | -0.194 | 0.037 | -0.230 | -0.092 | -0.125 | -0.080 | -0.053 | -0.002 | -0.131 | -0.126 | -0.107 | -0.107 | -0.122 | -0.118 | -0.129 | 1.000 | -0.046 | 0.008 | -0.087 | 0.053 | 0.076 | 0.255 | 0.150 | 0.000 | 0.087 | 0.065 | 0.000 |
| latitude | 0.099 | 0.099 | 0.512 | 0.032 | 0.064 | -0.057 | 0.015 | -0.066 | -0.049 | 0.086 | -0.083 | -0.086 | -0.052 | -0.053 | -0.095 | -0.097 | -0.113 | -0.046 | 1.000 | -0.026 | -0.134 | 0.151 | 0.215 | 0.589 | 0.301 | 0.047 | 0.154 | 0.141 | 0.000 |
| longitude | 0.125 | 0.126 | -0.349 | 0.054 | -0.114 | -0.021 | -0.053 | -0.016 | -0.012 | -0.035 | 0.045 | 0.044 | 0.028 | 0.029 | 0.020 | 0.019 | 0.025 | 0.008 | -0.026 | 1.000 | 0.050 | 0.126 | 0.148 | 0.490 | 0.166 | 0.016 | 0.062 | 0.198 | 0.000 |
| age | -0.147 | -0.147 | -0.001 | -0.046 | -0.293 | -0.313 | -0.239 | -0.285 | -0.291 | -0.078 | 0.063 | 0.086 | -0.061 | -0.040 | -0.160 | -0.148 | -0.025 | -0.087 | -0.134 | 0.050 | 1.000 | 0.160 | 0.187 | 0.176 | 0.155 | 0.017 | 0.341 | 0.051 | 0.031 |
| buildingtype | 0.207 | 0.203 | 0.149 | 0.238 | 0.246 | 0.144 | 0.052 | 0.166 | 0.148 | 0.119 | 0.137 | 0.137 | 0.113 | 0.114 | 0.156 | 0.139 | 0.126 | 0.053 | 0.151 | 0.126 | 0.160 | 1.000 | 0.732 | 0.200 | 0.195 | 0.177 | 0.280 | 0.703 | 0.000 |
| primarypropertytype | 0.235 | 0.253 | 0.252 | 0.153 | 0.263 | 0.173 | 0.156 | 0.190 | 0.226 | 0.121 | 0.280 | 0.271 | 0.243 | 0.246 | 0.276 | 0.298 | 0.259 | 0.076 | 0.215 | 0.148 | 0.187 | 0.732 | 1.000 | 0.244 | 0.295 | 0.159 | 0.345 | 0.604 | 0.000 |
| neighborhood | 0.190 | 0.157 | 0.880 | 0.048 | 0.137 | 0.060 | 0.061 | 0.050 | 0.050 | 0.056 | 0.057 | 0.054 | 0.027 | 0.022 | 0.000 | 0.041 | 0.000 | 0.255 | 0.589 | 0.490 | 0.176 | 0.200 | 0.244 | 1.000 | 0.288 | 0.049 | 0.160 | 0.149 | 0.000 |
| steamuse | 0.259 | 0.209 | 0.214 | 0.081 | 0.263 | 0.146 | 0.084 | 0.122 | 0.143 | 0.000 | 0.135 | 0.118 | 0.036 | 0.034 | 0.127 | 0.210 | 0.198 | 0.150 | 0.301 | 0.166 | 0.155 | 0.195 | 0.295 | 0.288 | 1.000 | 0.000 | 0.012 | 0.017 | 0.000 |
| electricity | 0.040 | 0.025 | 0.031 | 0.000 | 0.000 | 0.044 | 0.000 | 0.050 | 0.055 | 0.000 | 0.020 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.047 | 0.016 | 0.017 | 0.177 | 0.159 | 0.049 | 0.000 | 1.000 | 0.026 | 0.000 | 0.474 |
| naturalgas | 0.121 | 0.133 | 0.142 | 0.000 | 0.047 | 0.021 | 0.016 | 0.017 | 0.018 | 0.102 | 0.148 | 0.176 | 0.068 | 0.069 | 0.023 | 0.048 | 0.034 | 0.087 | 0.154 | 0.062 | 0.341 | 0.280 | 0.345 | 0.160 | 0.012 | 0.026 | 1.000 | 0.033 | 0.008 |
| defaultdata | 0.128 | 0.063 | 0.098 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.110 | 0.053 | 0.059 | 0.025 | 0.027 | 0.000 | 0.000 | 0.000 | 0.065 | 0.141 | 0.198 | 0.051 | 0.703 | 0.604 | 0.149 | 0.017 | 0.000 | 0.033 | 1.000 | 1.000 |
| compliancestatus | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.150 | 0.000 | 0.162 | 0.168 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.031 | 0.000 | 0.000 | 0.000 | 0.000 | 0.474 | 0.008 | 1.000 | 1.000 |
| Unnamed: 0 | osebuildingid | buildingtype | primarypropertytype | taxparcelidentificationnumber | councildistrictcode | neighborhood | numberofbuildings | numberoffloors | propertygfatotal | propertygfaparking | propertygfabuilding_s | listofallpropertyusetypes | largestpropertyusetype | largestpropertyusetypegfa | energystarscore | siteeui_kbtu_sf | siteeuiwn_kbtu_sf | sourceeui_kbtu_sf | sourceeuiwn_kbtu_sf | siteenergyuse_kbtu | siteenergyusewn_kbtu | steamuse | electricity | naturalgas | defaultdata | compliancestatus | totalghgemissions | zipcode | latitude | longitude | age | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | NonResidential | Hotel | 0659000030 | 7 | DOWNTOWN | 1.0 | 12 | 88434 | 0 | 88434 | Hotel | Hotel | 88434.0 | 60.0 | 81.699997 | 84.300003 | 182.500000 | 189.000000 | 7226362.5 | 7456910.0 | True | True | True | False | Compliant | 249.98 | 98101.0 | 47.61220 | -122.33799 | 96 |
| 1 | 1 | 2 | NonResidential | Hotel | 0659000220 | 7 | DOWNTOWN | 1.0 | 11 | 103566 | 15064 | 88502 | Hotel, Parking, Restaurant | Hotel | 83880.0 | 61.0 | 94.800003 | 97.900002 | 176.100006 | 179.399994 | 8387933.0 | 8664479.0 | False | True | True | False | Compliant | 295.86 | 98101.0 | 47.61317 | -122.33393 | 27 |
| 2 | 2 | 3 | NonResidential | Hotel | 0659000475 | 7 | DOWNTOWN | 1.0 | 41 | 956110 | 196718 | 759392 | Hotel | Hotel | 756493.0 | 43.0 | 96.000000 | 97.699997 | 241.899994 | 244.100006 | 72587024.0 | 73937112.0 | True | True | True | False | Compliant | 2089.28 | 98101.0 | 47.61393 | -122.33810 | 54 |
| 3 | 3 | 5 | NonResidential | Hotel | 0659000640 | 7 | DOWNTOWN | 1.0 | 10 | 61320 | 0 | 61320 | Hotel | Hotel | 61320.0 | 56.0 | 110.800003 | 113.300003 | 216.199997 | 224.000000 | 6794584.0 | 6946800.5 | True | True | True | False | Compliant | 286.43 | 98101.0 | 47.61412 | -122.33664 | 97 |
| 4 | 4 | 8 | NonResidential | Hotel | 0659000970 | 7 | DOWNTOWN | 1.0 | 18 | 175580 | 62000 | 113580 | Hotel, Parking, Swimming Pool | Hotel | 123445.0 | 75.0 | 114.800003 | 118.699997 | 211.399994 | 215.600006 | 14172606.0 | 14656503.0 | False | True | True | False | Compliant | 505.01 | 98121.0 | 47.61375 | -122.34047 | 43 |
| 5 | 5 | 9 | Nonresidential COS | Other | 0660000560 | 7 | DOWNTOWN | 1.0 | 2 | 97288 | 37198 | 60090 | Police Station | Police Station | 88830.0 | NaN | 136.100006 | 141.600006 | 316.299988 | 320.500000 | 12086616.0 | 12581712.0 | False | True | True | False | Compliant | 301.81 | 98101.0 | 47.61623 | -122.33657 | 24 |
| 6 | 6 | 10 | NonResidential | Hotel | 0660000825 | 7 | DOWNTOWN | 1.0 | 11 | 83008 | 0 | 83008 | Hotel | Hotel | 81352.0 | 27.0 | 70.800003 | 74.500000 | 146.600006 | 154.699997 | 5758795.0 | 6062767.5 | False | True | True | False | Compliant | 176.14 | 98101.0 | 47.61390 | -122.33283 | 97 |
| 7 | 7 | 11 | NonResidential | Other | 0660000955 | 7 | DOWNTOWN | 1.0 | 8 | 102761 | 0 | 102761 | Other - Entertainment/Public Assembly | Other - Entertainment/Public Assembly | 102761.0 | NaN | 61.299999 | 68.800003 | 141.699997 | 152.300003 | 6298131.5 | 7067881.5 | True | True | True | False | Compliant | 221.51 | 98101.0 | 47.61327 | -122.33136 | 97 |
| 8 | 8 | 12 | NonResidential | Hotel | 0939000080 | 7 | DOWNTOWN | 1.0 | 15 | 163984 | 0 | 163984 | Hotel | Hotel | 163984.0 | 43.0 | 83.699997 | 86.599998 | 180.899994 | 187.199997 | 13723820.0 | 14194054.0 | False | True | True | False | Compliant | 392.16 | 98104.0 | 47.60294 | -122.33263 | 119 |
| 9 | 9 | 13 | Multifamily MR (5-9) | Mid-Rise Multifamily | 0939000105 | 7 | DOWNTOWN | 1.0 | 6 | 63712 | 1496 | 62216 | Multifamily Housing | Multifamily Housing | 56132.0 | 1.0 | 81.500000 | 85.599998 | 182.699997 | 187.399994 | 4573777.0 | 4807679.5 | True | True | True | False | Compliant | 151.12 | 98104.0 | 47.60284 | -122.33184 | 113 |
| Unnamed: 0 | osebuildingid | buildingtype | primarypropertytype | taxparcelidentificationnumber | councildistrictcode | neighborhood | numberofbuildings | numberoffloors | propertygfatotal | propertygfaparking | propertygfabuilding_s | listofallpropertyusetypes | largestpropertyusetype | largestpropertyusetypegfa | energystarscore | siteeui_kbtu_sf | siteeuiwn_kbtu_sf | sourceeui_kbtu_sf | sourceeuiwn_kbtu_sf | siteenergyuse_kbtu | siteenergyusewn_kbtu | steamuse | electricity | naturalgas | defaultdata | compliancestatus | totalghgemissions | zipcode | latitude | longitude | age | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3324 | 3366 | 50210 | Nonresidential COS | Office | 2425039137 | 7 | MAGNOLIA / QUEEN ANNE | 1.0 | 1 | 13661 | 0 | 13661 | Office | Office | 13661.0 | 75.0 | 36.799999 | 40.900002 | 115.500000 | 128.399994 | 5.026677e+05 | 5.585251e+05 | False | True | False | True | NaN | 3.50 | 98119.2 | 47.63572 | -122.37525 | 71 |
| 3325 | 3367 | 50212 | Nonresidential COS | Other | 2925049087 | 3 | EAST | 1.0 | 1 | 23445 | 0 | 23445 | Other - Recreation | Other - Recreation | 23445.0 | NaN | 254.899994 | 286.500000 | 380.100006 | 413.200012 | 5.976246e+06 | 6.716330e+06 | False | True | True | False | Compliant | 259.22 | 98106.0 | 47.63228 | -122.31574 | 111 |
| 3326 | 3368 | 50219 | Nonresidential COS | Mixed Use Property | 7544800245 | 3 | CENTRAL | 1.0 | 1 | 20050 | 0 | 20050 | Fitness Center/Health Club/Gym, Office, Other - Recreation, Other - Technology/Science | Other - Recreation | 8108.0 | NaN | 90.400002 | 99.400002 | 175.199997 | 184.600006 | 1.813404e+06 | 1.993137e+06 | False | True | True | False | Compliant | 60.81 | 98126.4 | 47.60775 | -122.30225 | 29 |
| 3327 | 3369 | 50220 | Nonresidential COS | Office | 4154300585 | 2 | SOUTHEAST | 1.0 | 1 | 15398 | 0 | 15398 | Office | Office | 15398.0 | 93.0 | 25.200001 | 26.900000 | 64.099998 | 66.699997 | 3.878100e+05 | 4.141724e+05 | False | True | True | True | NaN | 7.79 | 98120.6 | 47.56440 | -122.27813 | 63 |
| 3328 | 3370 | 50221 | Nonresidential COS | Other | 2524039059 | 1 | DELRIDGE | 1.0 | 1 | 18261 | 0 | 18261 | Other - Recreation | Other - Recreation | 18261.0 | NaN | 51.000000 | 56.200001 | 126.000000 | 136.600006 | 9.320821e+05 | 1.025432e+06 | False | True | True | False | Compliant | 20.33 | 98126.0 | 47.54067 | -122.37441 | 41 |
| 3329 | 3371 | 50222 | Nonresidential COS | Office | 1624049080 | 2 | GREATER DUWAMISH | 1.0 | 1 | 12294 | 0 | 12294 | Office | Office | 12294.0 | 46.0 | 69.099998 | 76.699997 | 161.699997 | 176.100006 | 8.497457e+05 | 9.430032e+05 | False | True | True | True | NaN | 20.94 | 98126.0 | 47.56722 | -122.31154 | 33 |
| 3330 | 3372 | 50223 | Nonresidential COS | Other | 3558300000 | 2 | DOWNTOWN | 1.0 | 1 | 16000 | 0 | 16000 | Other - Recreation | Other - Recreation | 16000.0 | NaN | 59.400002 | 65.900002 | 114.199997 | 118.900002 | 9.502762e+05 | 1.053706e+06 | False | True | True | False | Compliant | 32.17 | 98113.0 | 47.59625 | -122.32283 | 19 |
| 3331 | 3373 | 50224 | Nonresidential COS | Other | 1794501150 | 7 | MAGNOLIA / QUEEN ANNE | 1.0 | 1 | 13157 | 0 | 13157 | Fitness Center/Health Club/Gym, Other - Recreation, Swimming Pool | Other - Recreation | 7583.0 | NaN | 438.200012 | 460.100006 | 744.799988 | 767.799988 | 5.765898e+06 | 6.053764e+06 | False | True | True | False | Compliant | 223.54 | 98112.0 | 47.63644 | -122.35784 | 49 |
| 3332 | 3374 | 50225 | Nonresidential COS | Mixed Use Property | 7883603155 | 1 | GREATER DUWAMISH | 1.0 | 1 | 14101 | 0 | 14101 | Fitness Center/Health Club/Gym, Food Service, Office, Other - Recreation, Pre-school/Daycare | Other - Recreation | 6601.0 | NaN | 51.000000 | 55.500000 | 105.300003 | 110.800003 | 7.194712e+05 | 7.828413e+05 | False | True | True | False | Compliant | 22.11 | 98108.0 | 47.52832 | -122.32431 | 34 |
| 3333 | 3375 | 50226 | Nonresidential COS | Mixed Use Property | 7857002030 | 2 | GREATER DUWAMISH | 1.0 | 1 | 18258 | 0 | 18258 | Fitness Center/Health Club/Gym, Food Service, Office, Other - Recreation, Pre-school/Daycare | Other - Recreation | 8271.0 | NaN | 63.099998 | 70.900002 | 115.800003 | 123.900002 | 1.152896e+06 | 1.293722e+06 | False | True | True | False | Compliant | 41.27 | 98118.7 | 47.53939 | -122.29536 | 85 |